Model Selection

GUI Visual Positioning

# GUI Visual Positioning

GUI Actor 7B Qwen2 VL

GUI-Actor-7B is a vision-language model developed based on Qwen2-VL-7B-Instruct, focusing on graphical user interface (GUI) agent tasks and providing a coordinate-free visual grounding solution.

Multimodal Fusion

UGround is a powerful GUI visual positioning model trained using a simple method, jointly developed by OSUNLP and Orby AI.

Multimodal Fusion

Transformers English

UGround is a powerful GUI visual positioning model trained with a streamlined recipe, developed by the Ohio State University NLP Group in collaboration with Orby AI.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase